On Anomaly Ranking and Excess-Mass Curves, Supplementary Material
نویسندگان
چکیده
Let t > 0. Recall that EM∗(t) = α(t) − tλ(t) where α(t) denote the mass at level t, namely α(t) = P(f(X) ≥ t), and λ(t) denote the volume at level t, i.e. λ(t) = Leb({x, f(x) ≥ t}). For h > 0, let A(h) denote the quantity A(h) = 1/h(α(t + h) − α(t)) and B(h) = 1/h(λ(t + h) − λ(t)). It is straightforward to see that A(h) and B(h) converge when h→ 0, and expressing EM∗ ′ = α′(t)−tλ′(t)−λ(t), it suffices to show that α′(t)−tλ′(t) = 0, namely limh→0A(h)−t B(h) = 0. Now we have A(h) − t B(h) = 1 h ∫ t≤f≤t+h f − t ≤ 1 h ∫ t≤f≤t+h h = Leb(t ≤ f ≤ t+h)→ 0 because f has no flat part.
منابع مشابه
On Anomaly Ranking and Excess-Mass Curves
Learning how to rank multivariate unlabeled observations depending on their degree of abnormality/novelty is a crucial problem in a wide range of applications. In practice, it generally consists in building a real valued ”scoring” function on the feature space so as to quantify to which extent observations should be considered as abnormal. In the 1d situation, measurements are generally conside...
متن کاملHow to Evaluate the Quality of Unsupervised Anomaly Detection Algorithms?
When sufficient labeled data are available, classical criteria based on Receiver Operating Characteristic (ROC) or Precision-Recall (PR) curves can be used to compare the performance of unsupervised anomaly detection algorithms. However, in many situations, few or no data are labeled. This calls for alternative criteria one can compute on non-labeled data. In this paper, two criteria that do no...
متن کاملTitle : Adaptive Anomaly Detection using Isolation Forest
Ranking measure is of prime importance in anomaly detection tasks because it is required to rank the instances from the most anomalous to the most normal. This paper investigates the underlying assumptions and definitions used for ranking in existing anomaly detection methods; and it has three aims: First, we show evidence that the two commonly used ranking measures—distance and density—cannot ...
متن کاملSupplementary Material: Entropy Measures for Stochastic Processes with Applications in Functional Anomaly Detection
Supplementary Material: Entropy Measures for Stochastic Processes with Applications in Functional Anomaly Detection Gabriel Martos 1, ̊, Nicolás Hernández 2, Alberto Muñoz 2 and Javier M. Moguerza 3 1 Universidad de Buenos Aires and CONICET; [email protected] 2 Universidad Carlos III de Madrid; {nihernan,albmun}@est-econ.uc3m.es 3 Universidad Rey Juan Carlos; [email protected] ...
متن کاملAnomaly Ranking as Supervised Bipartite Ranking
The Mass Volume (MV) curve is a visual tool to evaluate the performance of a scoring function with regard to its capacity to rank data in the same order as the underlying density function. Anomaly ranking refers to the unsupervised learning task which consists in building a scoring function, based on unlabeled data, with a MV curve as low as possible at any point. In this paper, it is proved th...
متن کامل